Speaker Identification based on Hybrid Clustering and Radial Basis Function

نویسندگان

  • Yap Teck Ann
  • Mohd Shafry Mohd Rahim
  • Ayman Altameem
  • Amjad Rehman
  • Ismail Mat Amin
  • Tanzila Saba
  • Salman Abdul Aziz
  • Salman bin Abdul Aziz
چکیده

Speaker identification is the computing task to identify an unknown identity based on the voice. A good speaker identification system must have a high accuracy rate to avoid invalid identity. Despite of last few decades’ efforts, accuracy rate in speaker identification is still low. In this paper, we propose a hybrid approach of unsupervised and supervised learning i.e. subtractive clustering and radial basis function(Sub-RBF).The proposed fused technique yields promising results because subtractive clustering is able to solve the initial guesses of cluster center and difficulty level to determine the number of cluster. Besides that, RBF has simple network structure and faster learning algorithm. In RBF input to output map uses the local approximations which will combine the linear approximations and causes the linear combinations with less weight. RBF neural network model uses subtractive clustering algorithm to select the hidden node centers for high training speed. In the meantime, the RBF network is trained with a regularization term so as to minimize the variances of the nodes in the hidden layer and to perform accurate prediction. Promising results are achieved to identify speaker using proposed fused approach. [Ann Y. T, Rahim M.S.M., Altameem A, Rehman A, Amin, I, M. Saba T. Speaker Identification based on Hybrid Clustering and Radial Basis Function. J Am Sci 2012;8(10):71-75]. (ISSN: 1545-1003). http://www.jofamericanscience.org. 12

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combination of Subtractive Clustering and Radial Basis Function in Speaker Identification

Speaker identification is the process of determining which registered speaker provides a given utterance. Speaker identification required to make a claim on the identity of speaker from the Ns trained speaker in its user database. In this study, we propose the combination of clustering algorithm and the classification technique – subtractive and Radial Basis Function (RBF). The proposed techniq...

متن کامل

Speaker and Gender Identification using Multilingual Speech

As the demand for multilingual speaker recognizers increases, the development of systems which combine automatic speaker and gender identification, models becomes increasingly important. In this work a speaker and gender identification system is developed using multilingual speech signal as input. MFCCs and delta-MFCCs, LPC, LPCC , Formants ,ZCR are used to build modal for classification and to...

متن کامل

Nonlinear System Identification Using Rbf Networks with Linear Input Connections

This paper presents a modified RBF network with additional linear input connections together with a hybrid training algorithm. The training algorithm is based on kmeans clustering with square root updating method and Givens least squares algorithm with additional linear input connections features. Two real data sets have been used to demonstrate the capability of the proposed RBF network archit...

متن کامل

Hybrid networks based on RBFN and GMM for speaker recognition

In this paper, a hybrid network based on the combination of Radial Basis Function Networks (RBFNs) and Gaussian Mixture Models (GMMs) is proposed and used for speaker recognition. The hybrid network is a hierarchical one, where a GMM is built for each speaker and an RBFN is built for each group of speakers. The GMMs and RBFNs are trained independently. The RBFNs are used as a rst stage coarse c...

متن کامل

Hybrid Network Based on Rbfn and Gmm for Speaker Recognition

In this paper, a hybrid network based on the combination of Radial Basis Function Networks (RBFNs) and Gaussian Mixture Models (GMMs) is proposed and used for speaker recognition. The hybrid network is a hierarchical one, where a GMM is built for each speaker and an RBFN is built for each group of speakers. The GMMs and RBFNs are trained independently. The RBFNs are used as a rst stage coarse c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013